Approximation to real numbers by cubic algebraic integers II 



by Damien ROY 

Abstract. It has been conjectured for some time that, for any integer n > 2, any real 
number e > and any transcendental real number £, there would exist infinitely many 
algebraic integers a of degree at most n with the property that |f — a| < H(a)~ n+e , where 
H(a) denotes the height of a. Although this is true for n = 2, we show here that, for 
n = 3, the optimal exponent of approximation is not 3 but (3 + \/5)/2 ~ 2.618. 

1. Introduction 

Define the height H(a) of an algebraic number a as the largest absolute value of 
the coefficients of its irreducible polynomial over Z. Thanks to work of H. Davenport 
and W. M. Schmidt, we know that, for any real number £ which is neither rational nor 
quadratic over Q, there exists a constant c > such that the inequality 



|f - a\ < cH(a) 



2 

"7 



where 7 = (1 + \/5)/2 denotes the golden ratio, has infinitely many solutions in algebraic 
integers a of degree at most 3 over Q (see Theorem 1 of [3]). The purpose of this paper is 
to show that the exponent 7 2 in this statement is best possible. 

Theorem 1.1. There exists a reai number f which is transcendental over Q and a constant 
ci > such that, for any algebraic integer a of degree at most 3 over Q, we have 



|f - a\ > aH(a) 



-7 2 



In general, for a positive integer n, denote by r n the supremum of all real numbers r 
with the property that any transcendental real number £ admits infinitely many approxi- 
mations by algebraic integers a of degree at most n over Q with |f — a\ < H(a)~ T . Then, 
the above result, shows that T3 = ■y 2 ~ 2.618 against the natural conjecture that r n = n 
for all n > 2 (see [7], page 259). Since r 2 = 2 (see the introduction of [3]), it leaves open 
the problem of evaluating r n for n > 4. At present the best known estimates valid for 
general n > 2 are 

\(n + l)/2] <r n <n 

where the upper bound comes from standard metrical considerations, while the lower 
bound, due to M. Laurent [4], refines, for even integers n, the preceding lower bound 
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T~n > L( n + 1)/2J of Davenport and Schmidt [3]. Note that similar estimates are known for 
the analog problem of approximation by algebraic numbers, but in this case the optimal 
exponent is known only for n < 2 (see [2]). 

In the next section we recall the results that we will need from [6]. Then, in Section 3, 
we present the class of real numbers for which we will prove, in Section 4, that they satisfy 
the measure of approximation of Theorem 1.1. Section 3 also provides explicit examples 
of such numbers based on the Fibonacci continued fractions of [5] and [6] (a special case 
of the Sturmian continued fractions of [1]). 

2. Extremal real numbers 

The arguments of Davenport and Schmidt in Section 2 of [3] show that, if a real 
number £ is not algebraic over Q of degree at most 2 and has the property stated in 
Theorem 1.1, then there exists another constant c 2 > such that the inequalities 

l<\x \<X, \x t-x 1 \<c 2 X- 1 /~ / , \x £ 2 -x 2 \ <c 2 X~ 1 ^, (2.1) 

have a solution in integers xq, xi, x 2 for any real number X > 1. In [6], we defined a real 
number £ to be extremal if it is not algebraic over Q of degree at most 2 and satisfies 
the latter property of simultaneous approximation. We showed that such numbers exist 
and form an enumerable set. Thus, candidates for Theorem 1.1 have to be extremal real 
numbers. 

For each x = (xo, x\, x 2 ) G Z 3 and each £ e R, we define 

||x|| = max{|:ro|, \x 2 \\ and L^(x) = max{|xo£ — xi\, \xo^ 2 — x 2 \}. 

Identifying x with the symmetric matrix 

fx Xx\ 
\Xl x 2 J ' 

we also define 

det(x) = xqx 2 — x\. 

Then, Theorem 5.1 of [6] provides the following characterization of extremal real numbers. 

Proposition 2.1. A real number £ is extremal if and only if there exists a constant cs > 1 
and an unbounded sequence of non-zero points (x.k)k>i of Z 3 satisfying, for all k > 1, 

(i) Cg^lxfcp < ||x fc+ i|| < c 3 ||x fc || 7 , 

(ii) eg ^IxfcH" 1 < L c (x fc ) < csllxfcH" 1 , 
(in) 1 < | det(x fc )| < c 3 , 

(iv) 1 < |det(x fc ,x fc+ i,x fc+2 )| < c 3 . 

In order to prove our main Theorem 1.1, we will also need the following special case 
of Proposition 9.1 of [6] where, for a real number £, the symbol {t} denotes the distance 
from £ to a closest integer: 
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Proposition 2.2. Let £ be an extremal real number and let (x.k)k>i be as in Proposition 
2.1. Assume that, upon writing = (xk,o, Xk,i, Xk,2), there exists a constant C4 > such 
that 



for all k > 1. Then, for any algebraic integer a of degree at most 3 over Q, we have 



for some constant C5 > 0. 

Since extremal real numbers are transcendental over Q (see §5 of [6]), this reduces 
the proof of Theorem 1.1 to finding extremal real numbers satisfying the hypotheses of the 
above proposition. Note that, for an extremal real number £ and a corresponding sequence 
(xfc)fc>i, Proposition 9.2 of [6] shows that there exists a constant c 6 > such that 



for any sufficiently large k. 

We also mention the following direct consequence of Corollary 5.2 of [6]: 

Proposition 2.3. Let £ be an extremal real number and let ('x.k)k>i be as in Proposition 
2.1. Then there exists an integer ko > 1 and a 2 x 2 matrix M with integral coefficients 
such that, viewing each symmetric matrix, the point X&+2 is a rational multiple of 

Xfc+iMxfc when k > ko is odd, and a rational multiple of Xjt+i t Mx/ c when k > ko is even. 

Proof. Corollary 5.2 together with formula (2.2) of [6] show that there exists an integer 
ko > 1 such that x^+2 is a rational multiple of Xjt+ix^^Xfc+i for all k > ko- If S is a 
2x2 matrix such that x^+i is a rational multiple of x^Xfe-i for some k > ko, this implies 
that Xfc + 2 is a rational multiple of XfeSx^+i and thus, by taking transpose, that x^+2 is 
a rational multiple of x^i^x^. The conclusion then follows by induction on k, upon 
choosing M so that the required property holds for k = ko- Q 

Note that, in the case where all points x^ have determinant 1, one may assume that 
M G GL 2 (Z) in the above proposition and the conclusion then becomes x.k+i = ix/c+i'Sxfc 
where £ is either M or l M depending on the parity of k > ko- This motivates the following 
definition: 

Definition 2.4. Let M E GL 2 (Z) be a non-symmetric matrix. We denote by £(M) the set 
of extremal real numbers £ with the following property. There exists a sequence of points 
(*-k)k>i in Z 3 satisfying the conditions of Proposition 2.1 which, viewed as symmetric 
matrices, belong to GL 2 (Z) and verify the recurrence relation 



|£-a| > c 5 H(a) 
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{^fc,o£ 3 } > c 6 ||x fc 
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Examples of extremal real numbers are the Fibonacci continued fractions £ a) b (see [5] 
and §6 of [6]) where a and b denote distinct positive integers. They are defined as the real 
numbers 

Ca,b = [0, a, 6, a, a, 6, . . .] = l/(a + 1/(6 +•••)) 

whose sequence of partial quotients begins with followed by the elements of the Fibonacci 
word on {a, &}, the infinite word abaab- ■ ■ starting with a which is a fixed point of the 
substitution a i— > ab and 5 h a. Corollary 6.3 of [6] then shows that such a number 
belongs to S(M) with 



(2.5) M 



a lWft l\ _ ( ab+1 a 
1 j V 1 j ~ V 6 1 



We conclude this section with the following result. 

Lemma 2.6. Assume that £ belongs to S(M) for some non-symmetric matrix 

and let (x.k)k>i be as in Definition 2.4. Then, upon writing = (xk,o, Xk,i,Xk,2), we have, 
for all k>2, 

(i) x fc+ 2 = (ax k ,o + (b + c)x k ,i + dx k ,2)^k+i ± Xfc-i, 

(ii) Xk,oXk+i,2 - x k ,2Xk+i,o = ±(ax k -i,o - dxk-1,2) ± (6 - c)x k -i,i 

Proof. For k > 1, we have 

x fc+ i = XfcSxfc.i and x fc+2 = x fc+ i*£x fc 
where 5 is M or *M according to whether is even or odd, and so 

x/c+2 = *Xfc+2 = XfcS'xfc+i = (x fc 5') 2 x fc _i. 
Since Cayley-Hamilton's theorem gives 

(xfeS 1 ) 2 = trace(x/ c 5')xfc5' — det(x/ c 5')/, 

we deduce 

x fc+2 = trace(x fc 5 , )x/ £+ i - det(xfc5 , )x fc _i 

which proves (i). Finally, (ii) follows from the fact that the left hand side of this inequality 
is the sum of the coefficients outside of the diagonal of the product 



x/jJxfc_|_i where J 



1 

-1 or 



and that, since Jx fc J = ±x fc , we have 

X fc Jx fc+ i = iJx^Xfc+i = ±JSx k -i- 

3. A smaller class of real numbers 

Although we expect that all extremal real numbers £ satisfy a measure of approxima- 
tion by algebraic integers of degree at most 3 which is close to that of Theorem 1.1, say 
with exponent 7 2 + e for any e > 0, we could only prove in [6] that they satisfy a measure 
with exponent (3/2)7 2 (see Theorem 1.5 of [6]). Here we observe that the formulas of 
Lemma 2.6 show a particularly simple arithmetic for the elements £ of £(M) when, in the 
notation of this lemma, the matrix M has b = 1, c = — 1 and d = 0. Taking advantage of 
this, we will prove: 

Theorem 3.1. Let a be a positive integer. Then, any element £ of 

«•=*(-! I) 
satisfies the measure of approximation of Theorem 1.1. 

The proof of this result will be given in the next section. Below, we simply show that, 
for a = 1, the corresponding set of extremal real numbers is not empty. 

Proposition 3.2. Let m be a positive integer. Then, the real number 

T) = (m + 1 + i m ,m+2)~ 1 = [0, 777, + 1 , m, 777 + 2, 777, 777, 777, + 2, . . .] 

belongs to the set Si . 

Proof. We first note that, if a real number £ belongs to £{M) for some non-symmetric 
matrix M G GL2(Z) with corresponding sequence of symmetric matrices (xfc)fc>i, and if 
C is any element of GL/2(Z), then the real number r\ for which (77, — 1) is proportional 
to (£, — 1)C belongs to £{ l CMC) with corresponding sequence (C _1 Xfc t C _1 )fc>i. The 
conclusion then follows since £, m ,m+2 belongs to £{M) where M is given by (2.5) with 
a = m and b = m + 2 and since 

Remark. In fact, it can be shown that £ a is not empty for any integer a > 1. For 
example, consider the sequence of matrices (x.k)k>i defined recursively using the formula 
of Definition 2.4 with 

(\ 1\ / a 3 + 2a a 3 -a 2 + 2a-l\ , fa l\ 

Xl = ^l oj' X2= U 3 "« 2 + 2a-l a 3 -2a 2 + 3a- 2 J and M= ^-l oj' 
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Then, using similar arguments as in Section 6 of [6], it can be shown that (~x.k)k>i is a 
sequence of symmetric matrices in GL^Z) which satisfies the four conditions of Proposition 
2.2 for some real number £ which therefore belongs to S a . 

4. Proof of Theorem 3.1 

We fix a positive integer a, a real number £ e S a , and a corresponding sequence of 
points (xfc)fc>i of Z 3 as in Definition 2.4. For simplicity, we also define 

X k = ||x fc || and 4 = {^fc,2£}, (A; > 1). 

The constant C3 being as in Proposition 2.1, we first note that 

{xk,i£,} < \xk,i£ - x k ,oC 2 \ + \xk,o€ 2 ~ XkM < + l)c 3 X A T 1 . 
For k > 2, the recurrence formula of Lemma 2.6 (i) implies 

(4.2) = ax kj0 x k +i,2 ± 

and Lemma 2.6 (ii) gives 

Using (4.1), the latter relation leads to the estimate 

{x k , x k+1}2 £,} < X k {x k +i,o€} + a{x k -i,o€} + 2{x fc _i 5 i£} < c 7 X~\ 

for some constant C7 > (since X k X^ x < c 3 +7 X^ 1 by virtue of Proposition 2.1 (i)). 
Combining this with (4.2), we deduce 

|4+2 - 4-i I < a{x kj0 x k+lj2 £,} < aorX^ v 

Since the sequence (X k ) k >i grows at least geometrically, this in turn implies that, for any 
pair of integers j and k which are congruent modulo 3 with j > k > 1, we have 

\Sj — Sk\ < c-sX^ 1 

with some other constant eg > 0. Since 

\{xk,oC 3 } -S k \< \xk,o£ 3 - XkM\ < C3|£|^ fc _1 , (k > 1), 
we conclude that, for % = 1, 2, 3, the limit 

0i = lim {x i+3j)0 £ 3 } = lim 5 l+3j 

j— >oo j— >oo 

exists and that 

1^ - {x fc ,o€ 3 }| < (c 8 + calCI)^ 1 
for k = i mod 3. Since, for all sufficiently large fc, Proposition 9.2 of [6] gives 

with a constant eg > 0, these numbers 9i are non-zero. Thus the sequence {{ x k,o£ 3 }) k>1 
has (at most three) non-zero accumulation points and therefore is bounded below by some 
positive constant, say for k > ko, to exclude the finitely many indices k where x kt o = 
0. Applying Proposition 2.2 to the subsequence ( , x. k ) k > ko , we conclude that £ has the 
approximation property stated in Theorem 1.1. 
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